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PREDICTING  MAGAZINE  AUDIENCES  WITH  A  LOGLINEAR  MODEL 


Abstract 

A  loglinear  model  for  predicting  magazine  exposure  distributions  is  developed  and  it’s 
parameters  are  estimated  by  using  the  maximum  likelihood  technique.  The  accuracy  of 
the  loglinear  and  a  Dirichlet-multinomial  model  are  compared  using  1985  AGB:  McNair 
data.  The  results  show  that  the  loglinear  model  has  significantly  smaller  prediction  errors 
than  the  Dirichlet-multinomial  model.  A  simple  algorithm  for  optimal  media  scheduling 
is  given. 

Introduction 


The  annual  advertising  budget  for  Anheuser-Busch  Incorporated  in  1983  was 
$290,616,400  (Advertising  Age  1984).  Having  spent  such  am  enormous  amount  of  money 
on  advertising,  the  company  wants  to  be  certain  their  money  is  being  spent  efficiently. 
Their  advertising  money  is  not  spent  efficiently  if  they  have  poor  estimates  of  the  potential 
audience  for  their  advertising  campaigns.  An  underestimate  of  the  audience  does  not  reflect 
the  actual  efficiency  of  the  advertising  campaign;  instead,  a  reduced  expenditure  may  still 
achieve  the  advertiser’s  desired  audience.  An  improvement  in  the  audience  estimates  for 
Anheuser-Busch,  reducing  the  annual  advertising  budget  by  only  1%,  would  amount  to  a 
saving  of  $2,906,164. 

Most  ad  agencies  and  advertisers  use  the  exposure  distribution  (e.d.)  to  assess  cam¬ 
paign  efficiency.  The  e.d.  is  the  proportion  of  the  target  population  which  sees  none,  one, 
two,  or  up  to  all  of  the  ads  in  the  campaign.  In  turn,  the  e.d.  is  used  to  estimate  reach, 
the  proportion  of  the  population  which  is  exposed  to  at  least  one  insertion,  frequency,  the 
proportion  of  the  population  which  is  exposed  to  at  least  one,  at  least  two,  or  up  to  all  of 
tl>'  ads,  and  effective  reach,  the  mean  of  the  e.d.  Effective  reach  is  also  known  as  “Gross 
Rating  Points”  (Naples  1979). 


Many  methods  have  been  employed  to  estimate  the  e.d.  However,  the  majority  are  ad 
hoc  and  lack  accuracy  over  a  range  of  advertising  campaigns  (Chandon  1976).  A  survey 
of  media  directors  from  288  advertising  agencies  in  the  U.S.  showed  that  media  directors 
place  “more  sophisticated  reach  analyses”  high  on  their  list  of  important  future  media 
research  services  (Russell  and  Martin  1980).  This  finding  was  corroborated  by  a  more 
recent  survey  (Leckenby  and  Kishi  1982b)  which  showed  that  90%  of  media  directors  from 
the  100  largest  advertising  agencies  believe  that  between  “some”  and  a  “great  deal”  of 
improvement  is  needed  in  e.d.  models.  The  same  survey  indicated  that  only  about  one- 
third  of  the  directors  perceive  that  the  models  they  currently  use  estimate  the  observed 
e.d.  to  within  five  percent. 

Another  important  use  of  e.d.  estimates  is  in  media  selection  (Aaker  1975;  Lee  1962, 
1963;  Little  and  Lodish  1969).  All  advertising  campaigns  have  a  budget.  It  usually  costs 
more  to  advertise  in  magazines  with  high  readership.  An  advertiser  can  achieve  a  high 
reach  by  placing  many  ads  in  a  magazine  with  a  high  readership  although  the  campaign  cost 
will  be  high  also.  Alternatively  it  is  usually  possible  to  achieve  high  reach  by  placing  ads  in 
a  range  of  magazines,  with  more  ads  in  lower  readership  magazines,  thereby  reducing  the 
overall  campaign  cost.  Accurate  reach  estimates  are  needed  to  juggle  insertion  placement 
in  a  variety  of  magazines  so  as  to  maximize  the  reach  or  effective  reach,  whilst  keeping 
within  the  budget. 

The  first  known  attempt  to  estimate  reach  for  a  media  schedule  was  by  Agostini 
(1961).  However,  Agostini’s  method  was  not  successful  when  applied  to  British  magazines 
(Metheringham  1964).  Metheringham’s  (1964)  own  method  used  the  beta-binomial  distri¬ 
bution  (BBD)  (Skellam  1948;  Ishii  and  Hayakawa  1960).  The  BBD  has  also  been  used  as 
an  e.d.  model  by  Greene  and  Stock  (1967),  Liebman  and  Lee  (1974)  and  Chandon  (1976). 
In  addition  the  BBD  has  been  applied  to  TV  schedules  (Headen,  Klompmaker  and  Teel 
1977;  Rust  and  Klompmaker  1981),  consumer  purchasing  behavior  (Chatfield  and  Good- 


hardt  1970;  Morrison  1979),  household  disease  distributions  (Griffiths  1973),  proportions 
with  extraneous  variance  (Kleiman  1973)  and  as  an  indicator  of  TV  loyalty  (Sabavala  and 
Morrison  1977).  However,  Metheringham’s  method  was  shown  to  overestimate  reach  and 
odd-numbered  exposures  (Schreiber  1969;  Chandon  1976).  His  method  also  fails  to  es¬ 
timate  the  characteristic  shape  of  the  e.d.  (Liebman  and  Lee  1974;  Leckenby  and  Kishi 
1982a). 

Chandon  (1976)  made  an  empirical  comparison  of  six  ad  hoe ,  ten  stochastic  and  three 
simulation  methods  of  estimating  the  e.d.  His  study  showed  that  two  of  the  better  methods 
used  Waring’s  formula  (cf.  Feller  1969)  to  estimate  terms  in  the  formula  where  triplicate 
and  higher  order  probabilities  were  needed.  One  method  was  introduced  by  Kwerel  (1964) 
and  the  other  by  Hofmans  (1966).  An  empirical  study  by  Leckenby  and  Kishi  (1982a) 
showed  that  Hofmans’  method  is  superior  to  Kwerel’s. 

A  model  currently  popular  for  estimating  the  e.d.  is  based  on  the  Dirichlet-  multi¬ 
nomial  distribution  (DMD)  (Chandon  1976;  Leckenby  1981;  Leckenby  and  Kishi  1982a, 
1984).  The  DMD  has  also  been  applied  to  brand  choice  models  (Goodhardt,  Ehrenberg 
and  Chatfield  1984),  mixed  media  models  (Rust  and  Leone  1982,  1984)  and  to  pollen 
counts  (Mosimann  1962).  The  best  known  method  to  date  for  estimating  the  e.d.  is  a 
DMD  model  of  Leckenby  and  Kishi’s  (1984)  which  uses  Hofman’s  geometric  distribution 
to  estimate  the  between-vehicle  duplication.  However,  the  major  deficiency  with  Leckenby 
and  Kishi’s  model  is  that  it  is  designed  primarily  for  schedules  with  equal  insertions  in  all 
the  magazines,  but  equal  insertion  placement  infrequently  occurs  in  practice.  Leckenby 
and  Kishi  state  that  their  model  can  be  used  for  unequal  insertions  by  “treating  multi¬ 
ple  insertions  in  a  vehicle  as  single  insertions  in  multiple  identical  vehicles.”  The  danger 
with  this  procedure  is  that  it  ignores  the  within-vehicle  duplication,  a  factor  known  to  be 
important  due  to  reading  loyalty. 

We  propose  to  develop  three  models  to  allow  for  e.d.s  with  one,  two,  and  three  or  more 
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magazines.  The  models  build  on  each  other  in  that  the  model  for  one  magazine  is  used  to 
improve  the  fit  of  the  model  for  two  magazines  and  the  model  for  two  magazines  is  used 
to  estimate  the  parameters  of  the  model  for  three  or  more  magazines.  In  contrast  to  some 
past  e.d.  models  we  will  use  rigorous  statistical  methodology  in  developing  the  models  for 
magazine  e.d.s  in  addition  to  employing  statistically-efficient  parameter  estimation. 

The  model  for  one  magazine  introduces  an  additional  parameter  to  the  BBD  to  account 
for  reading  loyalty.  This  simple  generalization  of  the  BBD  was  first  suggested  by  Chandon 
(1976)  although  his  method  of  parameter  estimation  was  naive,  sometimes  leading  to 
inconsistencies  in  his  estimates.  We  will  improve  upon  his  estimation  method. 

The  model  for  two  magazines  uses  the  DMD  to  model  a  bivariate  magazine  e.d.  The 
model  is  capable  of  handling  unequal  insertions  and,  again,  we  will  use  efficient  parameter 
estimation. 

The  model  for  three  or  more  magazines  uses  a  loglinear  model  to  estimate  the  cells 
of  a  multidimensional  contingency  table.  Bishop,  Fienberg  and  Holland  (1975)  give  a 
full  account  of  the  derivation  and  application  potential  for  loglinear  models.  We  make  a 
reasonable  simplifying  assumption  for  the  loglinear  model  from  which  it  follows  that  only 
bivariate  data  are  needed  to  fit  the  model.  Thus  our  model,  for  it’s  improved  accuracy, 
uses  only  as  much  data  as  existing  models. 

Our  three  models  are  empirically  tested  against  the  best  currently-known  models  using 
sample  exposure  data  from  the  AGB:McNair  “National  Media  Survey”  conducted  in  1985. 
Finally  we  give  a  simple  algorithm  for  which  our  e.d.  models  can  be  used  to  maximize 
either  reach  or  effective  reach  while  keeping  within  a  campaign  budget. 

The  Models 

One-Magazine  Model 

Many  people  subscribe  to  one  or  more  magazines.  Among  them  is  a  proportion  which 
always  reads  a  particular  magazine. 


Chandon  (1976)  suggested  a  modification  of  the  BBD  which  he  called  the  “two  segment 
beta-binomial  model”.  One  segment  is  definite  readers  and  the  other  is  probable  readers. 
They  represent  proportions  u>  and  l-u>  respectively.  The  parameter  u>  may  be  viewed  as 
a  loyalty  factor.  A  high  value  of  u>  indicates  an  appreciable  reading  loyalty  whilst  a  low 
value  indicates  little  or  no  loyalty  to  a  particular  magazine. 

By  mixing  this  loyalty  proportion  with  the  mass  function  of  the  BBD  we  obtain  the 
modified  BBD  (MBBD).  Let  X  be  the  number  of  exposures  a  person  has  to  k  insertions 
in  a  single  magazine.  The  mass  function  of  the  MBBD  is 


/*»(. x  =  X)  =  (1  -  U)(k)  rta  +  '?l,rt*-;  +  '>)r(f  +  g)  +  urt(X-M  , 

(1)  v  '  v  '  \xj  r(a  +  0  +  h)  T{/3)  T(a)  1  x  k}  ' 

x  —  0, . . .  ,k, 


where  T(/  +  1)  =  IT(1),  the  usual  gamma  fuction,  and  I{x=*}  is  1  when  X  =  k  and  0 


otherwise.  Note  that  when  w=0  the  MBBD  reduces  to  the  BBD. 


Chandon  (1976)  estimated  a,  (3  and  w  by  equating  the  sample  proportion  of  non- 
readers  to  the  proportion  of  nonreaders  given  by  the  MBBD  model  for  k  =  1,2,3.  These 
data  cannot  be  obtained  from  our  sample  data  (cf.  Ql  and  Q2  below).  Chandon ’s  system 
of  three  equations  sometimes  results  in  an  inconsistent  solution  which  forces  him  to  set 
cj  =  0,  thereby  losing  any  advantage  of  using  the  MBBD. 

We  use  maximum  likelihood  estimation  to  estimate  a,  /?  and  w.  The  likelihood 
equations  are  given  in  the  technical  appendix.  Danaher  (1987)  proved  that  the  maximum 
likelihood  estimates  (MLEs)  of  a,  /?  and  u  are  best  asymptotically  normal  (BAN),  which 
means  that  as  the  sample  size  (n)  tends  to  infinity  these  estimates  tend  to  their  true  values 
and  these  estimates  have  the  smallest  variance  among  all  estimates.  Serfiing  (1980,  p.  142) 
gives  a  formal  definition  of  this  concept. 


Two- Magazine  Model 


Let  Y\  =  the  number  of  exposures  exclusive  to  magazine  1,  Y2  =  the  number  of 
exposures  exclusive  to  magazine  2,  and  Y$  =  the  number  of  exposures  to  both  magazines  1 


and  2,  with  0  <  Y{  <  k  ,  i  =  1,2,3,  i.e.,  initially  there  are  k  insertions  in  both  magazines. 
Let  Y={YuY2,Ys)-t  then  the  DMD  has  mass  function  (Mosimann  1962) 


/“(Y  =  y)  = 


r(r)  r(*  -  EL,  i,.  + -ro)  a  rto  + ->■) 


(*  -  ELi  is)!  r(r  + 


r(-ro) 


n*  w  n) 


>  0,  r  '  li i  0  —  Vi  —  k,  i  —  1,2,3,  ^  ^  y,  ^  fc. 


Now  let  Xi  =  Yi  +  F3  and  X2  =  Yh  +  Vs!  then  (Xi,X2)  is  the  bivariate  e.d.  of  the 
numbers  of  exposures  to  magazines  1  and  2  with  mass  function, 


0Xi,x*(-Xi  =Xi,X2  =  x2)  = 


k\T(r) 


min{x  i,ij} 

E 


r(r  +  k)  ^ 

—  Jc} 


r(j!  -  x3  +  Ki)r(x2  -  x3  +  72)T(z3  +  73)r(A:  +  x3-i1-x2  +  7o) 

(xi  -  x3)l(x2  -  x3)!x3!(A:  +  x3  -  xr  -  x2)l  n?=0  r(7t) 

0  <  x,-  <  &,  *  =  1,2. 

At  this  stage  we  have  a  model  for  a  bivariate  e.d.  when  there  are  equal  insertions 
in  both  magazines.  If  the  media  schedule  requires  unequal  insertions  then  add  dummy 
insertions  to  the  magazine  with  the  lesser  insertions  to  get  insertion  equality.  This  method 
of  adding  dummy  insertions  was  used  by  Rust  and  Leone  (1984)  for  two  different  media, 
viz.,  TV  and  magazines.  Following  the  construction  of  the  bivariate  e.d.  using  (2)  a 
hypergeometric  adjustment  is  made  (Chandon  1976;  Rust  and  Leone  1984)  as  follows. 
Suppose  k{  insertions  are  placed  in  magazine  i,  i  =  1,2.  When  ki  >  k2, 


(3)  ^x;,X5(*i»*2)  =  E  *?  gxltx a(si>*)  »  0<  Xj  <  ,  *  =  1,2. 

3  =  XJ  \  M  ) 

Now  hxj,x;  >s  the  bivariate  e.d.  model  when  the  insertions  are  unequal  and  gxi,x7 
comes  from  (2). 

The  final  step  in  constructing  the  bivariate  e.d  model  is  to  adjust  (3)  so  that  it  has 
MBBD  marginals  from  (1)  by  using  iterated  proportions  (Deming  and  Stephan  1940). 


To  estimate  the  parameters  of  (3)  we  will  use  the  data  from  Ql  below  where  k  =  1 
and  denote  it  {nXlXi},  where  nXlXa  is  the  number  of  people  in  the  sample  who  have  xx 
exposures  to  magazine  t,  i  =  1,2  (0  <  £,•  <  1),  and  n  is  the  sample  size.  The  problem  now 
is  that  we  have  only  three  linearly  independent  data  (since  n  =  £  Xj  nXlXa)  and  four 
parameters  to  estimate. 

The  MLEs  of  the  7,-  are 

A  ^00  A  A  n  10.  A  ^01  A  A  «U  A 

4)  70  =  - r,  71  = - r,  72  =  - r,  73  =  - r  . 

n  n  n  n 

From  (4)  it  can  be  seen  that  in  order  to  estimate  the  7 »  ,*  =  0, 1,2,3,  it  is  necessary  to 
estimate  t  also.  Basu  and  de  B.  Pereira  (1982)  prove  that  Xl  and  X2  have  beta-binomial 
distributions  with  parameters  a,-  and  /?,•  defined  as  follows 

7i+Tf3  =  0£i,  7o+'n  =  Pu 

(5) 

72  +  73  =  <*2  .  Tfo  +  'll  =  02  • 

A 

Let  &i  and  Pi  be  MLEs  or  method  of  moments  estimates  of  a*  and  $  =  1,2.  From 
(5),  a,  +  Pi  =  70  +  7i  +  72  +  73  =  t,  i  =  1,2.  However,  in  general  <*1  +  Pi  ^  a2  +  p2 

A  A 

so  that  some  authors  take  a  weighted  average  of  07  +  Pi  and  a2  +  P2  to  estimate  r  with 
—  where  Wi  =  ,  *  =  1,2  (Chandon  1976,  Leckenby  and  Kishi  1984, 

Rust  and  Leone  1984).  We  found  this  unappealing  since  this  estimator  of  r  is  rather  ad 
hoc.  The  authors  could  equally  well  have  chosen  the  arithmetic,  geometric,  or  harmonic 
mean  of  (d*  +  pi),  i  =  1,2.  In  addition  this  estimate  is  usually  inconsistent,  i.e.,  as  the 
sample  size  tends  to  infinity,  the  weighted  average  estimate  of  r  above  does  not  tend  to  r 
(Danaher  1987). 


Our  estimate  of  r  is 


.  ( 
r  =  nl 


_ Q=l4lQ!2/?2 _ ' 

(nio  +  nn)(noo  +  ^oi)(»»oi  +  nn)(noo  +  ^io) , 


This  estimate  is  numerically  close  to  the  geometric  mean  of  (a<  +  Pi),  i  =  1,2.  This 
estimate  is  consistent  and  after  (3)  is  adjusted  to  conform  to  MBBD  marginals,  by  iterated 


I  *'#'!**  (*• 


proportional  fitting,  the  estimate  of  the  bivariate  e.d.  obtained  by  substituting  (4)  and  (6) 
into  (3)  is  BAN  (see  Danaher  1987). 

Model  for  Three  or  More  Magazines 

Here  we  construct  a  loglinear  model  to  estimate  the  m-dimensional  probabilities  for  an 
m- variate  e.d.  To  fix  ideas  we  will  examine  the  case  m  =  3  and  estimate  the  3-dimensional 
e.d.  of  (Xi, X2, X3),  where  X}  is  the  number  of  exposures  a  person  has  to  magazine 
J,  j  =  1, 2, 3. 

Using  the  notation  of  Fienberg  (1977)  the  general  loglinear  model  for  three  dimensions 


log  mijk  —  u  +  +  U2(j)  +  U3(jfc)  -1-  Ut2(»y)  +  Ul3(»fc)  +  U23(jJk)  +  Ui23(»jA:)  , 

(7) 

*  =  0, 1 . . . ki,  y  =  0,1,...*2,  k  —  0, 1, . . .  , 

where  mi]k  =  the  expected  number  of  people  in  the  sample  exposed  to  t  out  of  ki  insertions 
in  magazine  1,  j  out  of  k 2  insertions  in  magazine  2  and  k  out  of  k3  insertions  in  magazine 
3.  Constraints  on  the  parameters  in  (7)  are  given  in  Fienberg  (1977). 

Let  1  ijic  =  the  number  of  people  in  the  sample  exposed  to  i,j  and  k  insertions  in 
magazines  1,  2  and  3  respectively. 

The  loglinear  model  with  no  second-order  interaction  has  ul23(iyjt)  =  0  for  all  i,j\k 
in  (7).  We  tested  the  validity  of  the  no  second-order  interaction  assumption  on  220  com¬ 
binations  of  three  magazines  with  one  insertion  in  each  magazine.  Using  an  asymptotic 
chi-squared  test  given  by  Feinberg  (1977  33)  we  found  only  15  of  the  220  combinations 

had  significant  second-order  interaction  at  the  5  %  level  of  significance.  Since  we  have  a 
large  sample  size  of  5201  the  asymptotic  chi-sqnared  test  is  powerful  which  means  we  can 
be  confident  that  the  assumption  of  no  second-order  interaction  is  reasonable  for  most  of 
our  data. 

There  is  no  closed-form  solution  for  the  MLEs  of  but  Fienberg  (1977),  provides 

an  iterative  solution,  performing  the  following  steps  to  obtain  the  MLEs  of 


Step  1:  set  =  1  for  all  t,j,k,  then  for  u  =  0, 

Step  2:  , 

'J  + 

c*a  o  *•  (3^+2)  Xi. a. fa  *  (3i/  +  l) 

Step  2:  mj  -fc  =  ftflomly* 

»  +  fc 

SteD  4-  rn^3t,+3^  —  ?  +  ’*  ,  r n^3l/+2^ 

oiep  4.  m,.Vfc  —  (iV+s)  miyfc  > 

m+,fc 

where  x;J+  =  xyyk  etc. 

Now  repeat  steps  2  to  4  for  t/  =  1,2, . . .  until  the  change  in  m^y*  's  from  one  cycle  to 
the  next  is  small.  This  technique  is  called  iterative  proportional  fitting  and  was  introduced 
by  Deming  and  Stephan  (1940).  We  have  already  used  this  technique  for  the  two-magazine 
model. 

It  can  be  seen  from  steps  2  to  4  above  that  the  only  data  required  to  fit  the  loglinear 
model  with  no  second-order  interaction  is  {xiy+},  {xi+k}  and  {x+y*}.  In  a  typical  situation 
for  a  loglinear  fit  the  {x^y*}  come  from  the  sample  (Fienberg  1977).  It  is  possible  to  get 
{x.yfc}  from  the  sample  data  for  some  values  of  i,j,  and  k  (c.f.  Q1  and  Q2  below) .  However, 
if  we  knew  {x»yfc}  for  all  i,j  and  k  there  would  be  no  need  to  estimate  the  e.d. 

Since  we  only  need  to  know  {x»y+},  {xj+k}  and  {x+yk}  the  sensible  thing  to  do  is 
obtain  accurate  estimates  of  these  three  sets  of  bivariate  statistical-frequencies.  Hence, 
our  method  is  to  pretend  that  our  two-magazine  bivariate  e.d.  estimates  obtained  from 
(3)  (and  marginal  adjustment)  are  the  observed  values  in  steps  2  to  4  above.  Note  that 
we  must  multiply  the  probability  estimates  by  n  to  get  the  “observed  values” . 

Once  we  have  {rhijk}  we  can  estimate  the  mass  function  of  the  three  dimensional 
e.d.  (with  no  second-order  interaction).  Now  let  X  —  ]T)3_ x  X}  be  the  total  number  of 
exposures  an  individual  has  to  a  media  schedule.  Then  the  mass  function  of  X  (denoted 
f{X))  is  estimated  by 


HX  =  Z)  =  n  ™ijk 


—  0,1,...,  kj. 


An  efficient  algorithm  was  written  to  compute  the  {m^y*},  then  construct  (9).  It  is 


clear  how  this  procedure,  with  the  assumption  of  no  second-order  or  higher  interaction 
generalises  to  m  magazines  (m  >  3). 

Danaher  (1987)  proved  that  the  estimates  of  the  e.d.  probabilities  arising  from  this 
loglinear  model  are  BAN. 

We  can  think  of  the  two-magazine  DMD  model  above  as  being  a  “saturated”  loglinear 
model  (Fienberg  1977).  Hence,  from  now  on  we  will  consider  that  we  have  two  models, 
the  MBBD  for  one  magazine  and  a  loglinear  model  for  two  or  more  magazines. 

Empirical  Tests 

The  Available  Data 

To  fit  and  test  the  accuracy  of  our  e.d.  model  we  used  the  AGB:McNair  Surveys  New 
Zealand  Ltd.  “National  Media  Survey”  data  from  a  sample  of  n  =  5201  residents  of  New 
Zealand,  10  years  or  older.  This  multistage  cluster  sample  covered  the  period  July  through 
December  1985,  with  about  200  people  interviewed  each  week.  It  is  assumed  that  reading 
habits  are  stationary  over  time  so  that  we  may  think  of  all  5201  people  as  a  sample  from 
a  single  population.  Many  questions  were  asked  of  the  respondents  although  the  only  two 
question  formats  relevant  to  us  were  (for  weekly  magazines), 

Ql)  “Have  you  personally  read  or  looked  into  any  issue  of  ...(magazine  name)  in  the 
last  seven  days  -  it  doesn’t  matter  where?”  (Has  a  Y/N  answer) 

Q2)  “How  many  different  issues  of  ...(magazine  name)  do  you  personally  read  or  look 
into  in  an  average  month  -  it  doesn’t  matter  where?”  (Has  answer  0,1, 2, 3, 4  issues) 

Table  1  shows  how  Ql  and  Q2  are  modified  for  monthly,  two-weekly,  and  two-monthly 
magazines.  Ql  and  Q2  were  asked  for  40  different  magazines.  There  were  ten  general 
appeal  magazines,  eight  business  magazines,  six  were  women’s  magazines,  six  were  news 
magazines,  five  were  hobbies  magazines  and  five  were  sports  oriented.  The  magazines 
ranged  in  single  issue  reach  from  0.7%  to  39%  of  New  Zealand’s  population. 

We  extract  univariate  exposure  data  from  Q2  and  bivariate  exposure  data  from  Ql, 


Table  1:  Question  Inserts  for  Different  Magazine  Periods. 


Magazine 

Ql  period 

Q2  period 

Weekly 

last  seven  days 

average  month 

Two- weekly 

last  two  weeks 

last  three  months 

Monthly 

last  month 

last  six  months 

Two-monthly 

last  two  months 

last  twelve  months 

to  fit  the  models  above.  The  Simmon’s  data  used  by  Leckenby  and  Kishi  (1984)  allowed 
them  to  test  their  model  only  for  schedules  with  up  to  two  insertions  in  each  magazine. 
In  contrast,  we  can  test  our  models  at  insertion  levels  ranging  from  1  (using  Ql),  4  (for 
weekly  magazines,  using  Q2)  and  6  (for  nonweeklies,  using  Q2).  We  can  also  test  our 
models  for  schedules  with  unequal  insertions  by  using  a  mixture  of  Ql  and  Q2  to  get  the 
observed  e.d.  This  should  permit  a  thorough  test  of  our  models. 

Test  Design 

Since  our  one-magazine  model,  the  MBBD,  is  different  from  the  loglinear  model  for 
two  or  more  magazines  we  will  test  this  model  separately.  The  test  is  exhaustive  in  that 
the  MBBD  model  is  fitted  then  used  to  predict  observed  sample  e.d.s  for  all  40  magazines 
at  two  insertion  levels,  k  =  1  and  k  =  4  (for  weeklies)  or  k  =  6  (for  nonweeklies) ,  making 
80  total  schedules.  We  chose  the  BBD  as  the  best  currently  available  model  to  compare 
with  MBBD. 

For  schedules  with  two  or  more  magazines  we  constructed  a  completely  randomized 
design,  selecting  340  schedules  ranging  in  size  from  2  to  6  vehicles.  The  schedules  were 
chosen  so  that  half  of  them  had  equal  insertions.  Of  the  schedules  with  equal  insertions 
half  had  one  insertion  in  each  magazine  while  the  other  half  had  4  or  6  insertions  in  each 
magazine.  For  the  schedules  with  equal  insertions  we  also  fitted  Leckenby  and  Kishi’s 
(1984)  “DMDl”  model  (renamed  DMDLK)  as  this  model  performed  the  best  out  of  the 
eight  models  they  tested. 

In  the  case  of  the  two-magazine  schedules,  with  one  insertion  in  each  magazine,  both 
the  loglinear  and  DMDLK  models  reproduce  the  sample  data  exactly,  thereby  giving  no 


prediction  error.  Owing  to  this,  our  equal-insertion  schedules  for  two  magazines  have  only 
insertion  levels  4  or  6. 


Definition  of  errors 

The  most  common  technique  for  assessing  model  accuracy  is  to  use  the  model  to 
estimate  the  e.d.  for  a  schedule  whose  observed  sample  e.d.  is  already  known  (Liebman 
and  Lee  1974;  Chandon  1976;  Leckenby  and  Kishi  1982a,  1984;  Rust  and  Leone  1984). 

A 

Denote  /»  =  f(X  =  *)  and  /,,  t  =  0,1,..., k,  respectively,  as  the  observed  and 
estimated  probabilities  of  the  e.d.  Reach  and  effective  reach  are  denoted  p  and  pe.  Four 
measurements  of  error  will  be  used.  They  are, 

i)  Mean  squared  error  ( MSE )  where 

USE  =  £(/,  -  Uf  ; 

isO 

ii)  Relative  error  in  reach  ( RER )  where 

RER  =  1^°  ~~  -fol  =  \P~  P\. 

1  -  fo  P 


iii)  Error  in  the  exposure  probabilities  over  schedule  reach  ( EPOR )  where 


EPOR 


EL  I U  -  h\ , 

l-/o  ’ 


iv)  Absolute  error  in  effective  reach  ( AEER )  where 


AEER  =  | pe  -  pe |  . 


The  MSE  is  a  popular  measure  of  error  in  statistics  whilst  the  other  three  measures 
of  error  are  designed  more  specifically  for  magazine  exposure  models.  RER  and  EPOR 
h.'  e  been  used  by  Liebman  and  Lee  (1974)  and  Leckenby  and  Kishi  (1982a,  1984).  Notice 
that  (RER  +  EPOR)(  1  -  /0)  =  Efso  I /*  “  /*l  »  which  is  the  sum  of  the  absolute  errors. 


Thus  EPOR  may  be  thought  of  as  the  contribution  to  the  sum  of  absolute  errors  which 
is  attributable  to  nonzero  exposure  levels. 


Results  and  Discussion 

The  average  errors  for  the  four  error  measures  are  given  in  Table  2.  The  MBBD  has 
smaller  average  error  than  the  BBD  for  all  but  the  RER.  A  one-way  analysis  of  variance 
showed  that  the  difference  in  the  average  errors  between  the  two  models  is  significant  (p- 
value<  0.025)  for  the  EPOR  and  AEER.  That  is,  the  MBBD  gives  significantly  smaller 
errors  than  the  BBD  for  half  the  error  measures. 

Insert  Table  2  About  Here 

Table  2  also  shows  the  variation  of  the  average  errors  for  the  loglinear  and  DMDLK 
models  across  schedule  sizes  ranging  from  2  to  6.  The  loglinear  model  gives  smaller  average 
errors  than  the  DMDLK  for  all  the  error  measures  and  schedule  sizes.  In  addition,  the 
loglinear  model’s  errors  do  not  increase  in  magnitude  as  the  schedule  size  increases  from  3 
to  6,  as  occurs  for  the  DMDLK.  The  overall  average  for  the  entire  340  schedules  is  given 
in  the  last  row  of  Table  2  and  in  Table  3  these  averages  are  compared  by  using  a  one-way 
analysis  of  variance.  The  result  of  the  analysis  of  variance  F-tests  is  that  the  loglinear 
model  gives  significantly  smaller  (p-value<  0.001)  errors  than  the  DMDLK  for  all  four 
error  measures. 

Insert  Table  3  About  Here 

We  were  able  to  test  the  loglinear  model  for  schedules  with  unequal  insertions.  The 
magnitude  of  the  errors  for  these  schedules  did  not  differ  significantly  from  the  errors  for 
schedules  with  unequal  insertions. 

Generally,  the  errors  for  schedules  having  one  insertion  in  each  magazine  were  smaller, 
by  a  factor  of  at  least  10,  than  for  schedules  with  unequal  or  high,  equal,  insertions. 
Fc  this  reason  the  average  errors  for  the  two-magazine  schedules  are  unexpectedly  high 
(being  larger  than  for  three-magazine  schedules,  for  instance).  This  is  because  all  the 


two-magazine  schedules  had  four  or  more  insertions  in  each  magazine,  as  explained  above. 

One  drawback  of  the  loglinear  model  is  that  it’s  computation  time  increases  as  sched¬ 
ule  size  and  insertion  levels  increase.  For  example,  schedules  comprised  of  three  and  six 
magazines,  with  each  magazine  having  four  insertions,  took  0.1  and  201  CPU  seconds, 
respectively,  on  a  CYBER  760  mainframe.  However,  the  increased  computation  time  is 
not  felt  to  be  a  major  shortcoming  since,  in  practice,  few  schedules  exceed  six  magazines 
(Stroeven,  Managing  Director  of  AGB:McNair  Suveys  N.Z.  Ltd.,  personal  communication). 
Danaher  (1987)  discusses  ways  in  which  the  computation  time  for  very  large  schedules  can 
be  significantly  reduced,  at  the  expense  of  accuracy. 

Optimal  Media  Scheduling 

Now  that  we  have  excellent  models  for  estimating  the  e.d.  we  can  use  these  models  to 
answer  the  following  question,  “How  many  insertions  should  be  placed  in  each  magazine 
to  maximize  reach  or  effective  reach  whilst  keeping  within  a  predetermined  budget?” 

Suppose  we  have  m  magazines  with  kj  insertions  in  each  magazine,  at  a  cost  of  c}  for 
each  insertion,  j —  1,2, . . .  ,m.  Let  f(X  =  x)  denote  the  e.d.  mass  function  as  before  and 
let  the  total  allowable  campaign  cost  be  C.  A  formal  statement  of  the  integer  programming 
optimization  problems  is  to  vary  •  •  •  ,km)  so  as  to 


Maximize  p  —  1  —  f(X  =  0)  (reach), 
m  (q  |  cj  Q  ) 

or  Pc  =  ^  kj — - - 'jr*~  (effective  reach  under  the  MBBD  model), 

j—i  ai  +  Pi 

m 

subject  to  kjCj  <  C,  k3-  =  0, 1 . . .  <  oo,  j  =  1, . . . ,  m. 


Most  integer  programming  techniques  are  suited  only  to  linear  objective  functions 
(Garfinkel  and  Nemhauser  1972).  We  can  see  that  pe  is  linear  in  kj,  but  p  is  not,  so 
w<~  need  to  use  some  nonstandard  techniques  to  maximize  p  subject  to  the  constraint. 
Solution  algorithms  for  this  optimization  problem  by  the  branch  and  bound  (Garfinkel 


,n  ><*  «t|  >»i  A»t  . 


3 


and  Nemhauser  1972)  and  dynamic  programming  (Denardo  1982)  methods  are  available. 
A  feasible  solution  is  an  m-tuple  (&i,...,&m)  which  satisfies  the  above  constaint.  We 
devised  an  ad  hoc  solution  method  which  considers  the  entire  solution  space,  defined  by 
the  constraint  kjCj  <  C,  searching  for  the  feasible  solution  with  the  highest  reach 

or  effective  reach.  However,  we  can  eliminate  many  of  the  feasible  solutions  by  exploiting 
the  following  monotonicity  property  of  reach. 

Denote  by  p(ki, . . . ,  km)  the  reach  achieved  by  placing  kj  insertions  in  the  jth  maga¬ 
zine.  If  we  add  one  insertion  to  the  Ith  magazine  then 

p{ki  . .  • ,  ki  -k 1, . . . ,  km)  >  p(k\  •  •  • » kit . . . ,  km)  ,  /  =  1, . . . ,  m,  iy  ^  0,  j  =  1, . . . ,  tn. 

In  words,  this  inequality  says  that  adding  an  insertion  to  any  magazine  is  guaranteed  to 
increase  the  reach. 

This  componentwise  monotonicity  property  of  reach  is  very  important  since  it  greatly 

reduces  the  computation  needed  to  solve  the  reach  optimization  problem.  For  instance,  we 

know  p(2, 1, 1)  >  p(  1, 1, 1)  so  if  (2,1,1)  is  a  feasible  solution  there  is  no  need  to  find  p{  1, 1, 1) 

((1,1,1)  is  clearly  also  a  feasible  solution)  since  we  know  it  to  be  less  than  p( 2, 1, 1). 

A  formal  statement  of  the  algorithm  is: 

kj  ^  =  [C/Cj\ ,  j  —  1, . . .  ,  m,  Pmax  =  0» 

Order  the  magazines  so  that  C\  <  ci  <  •  •  •  <  cm  then, 

for  j= 1  to  m 

for  t  =  kj  to  0  step  —1 

if  |  kjCj  £  C  then  Pmax  *  P{kl  •  •  •  i^m)} 

update  optimal  solution  if  pmax  changes 

ki  «-  A:[0) 

kj+i  kj+i  —  1 
begin  j  loop  over  again 
end  if 

ei  Ji  t  loop 
end  j  loop. 


Effective  reach  also  has  the  componentwise  monotonicity  property  so  the  same  algo¬ 
rithm  above  can  be  used  to  find  the  optimal  schedule  when  maximizing  effective  reach. 

Table  4  gives  the  cost  and  readership  information  for  a  group  of  three  magazines  as  well 
as  the  optimal  solution  for  reach  and  effective  reach  with  a  campaign  cost,  C ,  of  $30,000. 
The  effective  reach  is  maximized  by  putting  all  of  the  insertions  in  the  magazine  with  the 
highest  ratio  of  single  issue  reach  to  cost  per  insertion,  whilst  the  reach  is  maximized  by 
placing  as  many  insertions  as  possible  in  the  most  popular  magazine.  There  were  102 
feasible  solutions  to  this  particular  optimization  problem  of  which  82  were  eliminated  by 
the  the  monotonicity  property  of  reach  and  effective  reach,  an  80%  reduction  in  the  number 
of  schedules  to  be  tested.  Without  using  the  monotonicity  property,  the  execution  times 
for  maximizing  p  and  pe  are  24.4  and  0.072  CPU  seconds,  respectively,  on  a  CYBER  760. 
If  the  monotonicity  property  is  used  the  corresponding  execution  times  are  4.8  and  0.061 
CPU  seconds.  These  represent,  respectively,  an  80%  and  15%  reduction  in  computer  time. 
Clearly,  we  have  made  a  major  saving  in  computation  time  when  maximizing  p  and  a 
smaller,  though  significant,  saving  when  maximizing  Re¬ 
insert  Table  4  About  Here 

Conclusion 

We  used  a  simple  generalization  of  the  BBD  suggested  by  Chandon  (1976),  viz.  the 
MBBD,  with  our  efficient  parameter  estimation  and  empirically  showed  that  it  performs 
better  that  the  BBD. 

The  major  contribution  of  this  study  is  the  application  of  a  loglinear  model  to  estimat¬ 
ing  multidimensional  e.d.s.  Leckenby  and  Kishi  (1984)  showed  that  exposure  models  based 
on  the  DMD  are  accurate  in  estimating  magazine  e.d.s.  However,  a  loglinear  model,  with 
no  second-order  or  higher  interaction,  is  shown  to  be  statistically  significantly  better  than 
L'  kenby  and  Kishi’s  most  accurate  DMD  model.  We  also  found  that  their  model  loses 
accuracy  as  the  schedule  size  increases  whereas  the  loglinear  model  sustains  it’s  accuracy 


over  a  range  of  schedule  s*.  °  In  addition,  the  DMDLK  is  not  designed  for  schedules  with 
unequal  insertions.  In  contrast,  the  loglinear  model  handles  unequal  insertions,  giving  the 
same  accuracy  as  for  schedules  with  equal  insertions. 

The  trade-off  for  the  increased  accuracy  of  the  loglinear  model  is  an  increase  in  com¬ 
putation  time.  This  is  felt  to  be  a  tolerable  shortcoming  considering  the  significant  im¬ 
provement  the  loglinear  model  makes  over  the  best  currently  known  model. 


Technical  Appendix 


Let  rii  be  the  number  of  people  in  the  sample  exposed  to  t  out  of  k  insertions  in  a 
magazine.  Denote  the  likelihood  function  for  the  MBBD  as  L  and  define 


c  =  c{a,0) 


r(a  +  Jfc)  r(a  +  0) 
r(a)  r(a +  /?  +  *) 


and  =  5^y=o  1/(t  +  j).  Then  the  likelihood  equations  are 

=  ^2  Ai («.*)»<  -  («  ~  n«.)Ai(a  +  >3, Ac) 

»=1 

(1  -  tu)c[A1(a,k)  -  Ai(a  +  /9,fe)|nfc 
a;  4-  (l  —  u/)c  ’ 

=J^A1(/?,k-t)n»  -(n-nfc)Ai(a  +  /?,A:) 

9/3  f^o 

_  (1  -  a;)cAi(tt  4-  0,k)rik 
w  +  (l-  u>)c  ’ 
dlogL  —  (n  —  nfc)  (1  -  c)nk 
du>  1  —  u>  w  +  (1  —  u/)c 


We  calculate  Cj  by  equating  9  ‘ J**  h  to  zero  and  get  u>  =  • 

When  u>  is  substituted  into  the  first  order  partial  derivatives  of  a  and  0  the  following 
second  order  partial  derivatives  result, 


£^o|L  =  (»»*}£  _  a  +  0'k))2m  _  c)  +  Aj(a  +  0,  k)  -  A,(a,*)| 

aor  1  —  c 

k-i 

-  A2(a,t')ni  +  (n  -  nfc)A2(a  +  0,k)y 

»= i 

3^1  Jn  ~nt)c  +  „,*)_  A,(a,*))  A,(o  +  /J,*)/(l  -  c)  +  A2(a  +  0,k)] 

d0oa  1  —  c 

+  (n  -  nfc)A2(a  +  0,k), 

a^oft  =(B-_"^[(A1(a  +  /».t))V(l  -  c)  +  Aj(o  +  /?,*)] 

k—  1 

-  ^2  A2(/?,n  -  *)n<  -f  (n  —  nfc)A2(a  +  0,k), 


where  =  —  ^Ai (*7,/).  The  MBBD  likelihood  equations  have  no  closed  form 

solution  but  may  be  solved  by  the  Newton-Raphson  method  where  the  above  second  order 
partial  derivatives  are  utilized.  Since  Cj  is  am  explicit  function  of  a  and  /?  the  numerical 
work  is  considerably  reduced. 


Table  2:  Average  Errors  for  Schedules  of  size  1  to  6  vehicles:  Comparison  Between 
MBBD  and  BBD  Models  and  Between  Loglinear  and  DMDLK  Models 


Schedule 

Size 

Model 

MSE  x  106 

Error  Type 

EPOR%  RER% 

AEER% 

1 

MBBD 

3.13 

11.70 

3.63 

0.33 

BBD 

5.20 

15.24 

3.20 

0.90 

2 

Loglinear 

4.74 

14.71 

1.98 

1.04 

DMDLK 

9.23 

24.22 

5.23 

3.75 

3 

Loglinear 

2.90 

11.34 

1.32 

0.79 

DMDLK 

4.90 

13.48 

2.24 

2.68 

4 

Loglinear 

6.98 

13.89 

1.74 

1.91 

DMDLK 

8.89 

15.51 

2.14 

3.28 

5 

Loglinear 

5.54 

11.01 

1.01 

0.93 

DMDLK 

18.69 

18.12 

2.30 

4.08 

6 

Loglinear 

4.98 

11.17 

1.85 

1.06 

DMDLK 

21.97 

19.35 

2.12 

6.47 

Overall 

Loglinear 

4.55 

12.95 

1.57 

1.10 

for  sizes  2-6 

DMDLK 

8.94 

18.03 

3.20 

3.46 

8.94 


18.03 


3.20 


3.46 
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Table  3:  Analysis  of  Variance  Comparison  of  DMDLK  and  Loglinear  Models 


Error 

Type 

Source 

d.f. 

Mean 

Square 

F 

MSE 

Between  models 

1 

2.13  x  10"7 

16.2° 

Error 

502 

1.31  x  10"8 

EPOR 

Between  models 

1 

0.286 

27.2° 

Error 

502 

0.0105 

RER 

Between  models 

1 

0.0294 

38.1“ 

Error 

502 

0.00077 

AEER 

Between  models 

1 

0.0616 

50.7° 

Error 

502 

0.00121 

ap-value  <  0.001. 
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Table  4:  Optimal  Solution  for  a  Three  Magazine  Schedule. 


Magazine 

Cost  per 
Insertion  $ 

Single  Issue 
Reach  % 

Optimal  Soln 
for  p 

Optimal  Soln 
for  pe 

Readers  Digest 

2400 

25.8 

0 

12 

Womans  Weekly 

5602 

37.9 

1 

0 

N.Z.  Listener 

6061 

39.0 

4 
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